Fast spectral analysis for approximate nearest neighbor search
نویسندگان
چکیده
In large-scale machine learning, of central interest is the problem approximate nearest neighbor (ANN) search, where goal to query particular points that are close a given object under certain metric. this paper, we develop novel data-driven ANN search algorithm data structure learned by fast spectral technique based on s landmarks selected ridge leverage scores. We show with overwhelming probability, our returns $$(1+\epsilon /4)$$ -ANN for any approximation parameter $$\epsilon \in (0,1)$$ . A remarkable feature it computationally efficient. Specifically, learning k-length hash codes requires $$O((s^3+ns^2)\log n)$$ running time and $$O(d^2)$$ extra space, returning needs $$O(k\log time. The experimental results computer vision natural language understanding tasks demonstrate significant advantage compared state-of-the-art methods.
منابع مشابه
Fast Approximate Nearest-Neighbor Search with k-Nearest Neighbor Graph
We introduce a new nearest neighbor search algorithm. The algorithm builds a nearest neighbor graph in an offline phase and when queried with a new point, performs hill-climbing starting from a randomly sampled node of the graph. We provide theoretical guarantees for the accuracy and the computational complexity and empirically show the effectiveness of this algorithm.
متن کاملComposite Quantization for Approximate Nearest Neighbor Search
This paper presents a novel compact coding approach, composite quantization, for approximate nearest neighbor search. The idea is to use the composition of several elements selected from the dictionaries to accurately approximate a vector and to represent the vector by a short code composed of the indices of the selected elements. To efficiently compute the approximate distance of a query to a ...
متن کاملApproximate Nearest Neighbor Search in $\ell_p$
We present a new locality sensitive hashing (LSH) algorithm for c-approximate nearest neighbor search in lp with 1 < p < 2. For a database of n points in lp, we achieve O(dn ) query time and O(dn + n) space, where ρ ≤ O((ln c)/c). This improves upon the previous best upper bound ρ ≤ 1/c by Datar et al. (SOCG 2004), and is close to the lower bound ρ ≥ 1/c by O’Donnell, Wu and Zhou (ITCS 2011). T...
متن کاملHardness of Approximate Nearest Neighbor Search
We prove conditional near-quadratic running time lower bounds for approximate Bichromatic Closest Pair with Euclidean, Manhattan, Hamming, or edit distance. Specifically, unless the Strong Exponential Time Hypothesis (SETH) is false, for every δ > 0 there exists a constant ε > 0 such that computing a (1 + ε)-approximation to the Bichromatic Closest Pair requires Ω (
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Machine Learning
سال: 2022
ISSN: ['0885-6125', '1573-0565']
DOI: https://doi.org/10.1007/s10994-021-06124-1